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The Circular Electron Positron Collider (CEPC) is designed to precisely measure the properties of the Higgs 
boson, study electroweak interactions at the Z-boson peak, and search for new physics beyond the Standard 
Model. As a component of the 4*” Conceptual CEPC detector, the drift chamber facilitates the measurement 
of charged particles. This study implemented a Geant4-based simulation and track reconstruction for the drift 
chamber. For the simulation, detector construction and response were implemented and added to the CEPC sim- 
ulation chain. The development of track reconstruction involved track finding using the Combinatorial Kalman 
Filter method and track fitting using the tool of GenFit. Using the simulated data, the tracking performance 
was studied. The results showed that both the reconstruction resolution and tracking efficiency satisfied the 
requirements of the CEPC experiment. 


Keywords: CEPC, Drift chamber, Detector simulation, Track reconstruction 


I. INTRODUCTION 35 parameters, such as the magnetic field and materials, are con- 


In 2012, the discovery of the Higgs boson by the ATLAS 
[1] and CMS [2] experiments at the Large Hadron Collider 
(LHC) [3] confirmed the predictions of the Standard Model 
(SM) [4] with tremendous success. However, the Standard 
Model does not adequately explain fundamental physical phe- 
nomena such as dark matter, dark energy, and neutrino oscil- 
lations. With the distinct properties predicted in different the- 
oretical models, the Higgs boson has become a critical piece 
of the puzzle in the search for physics beyond the Standard 
Model (BSM) [5, 6]. There are several future collider exper- 
iments, including future circular collider (FCC) [7], Circular 
Electron Positron Collider (CEPC) [8], International Linear 
Collider (ILC) [9], and Compact Linear Collider (CLIC) [10], 
which aim to precisely measure the properties of the Higgs 
boson [11-14]. The CEPC is a 100 km circular electron- 
positron collider proposed by Chinese physicists. It has been 
designed to provide electron-positron collisions at a central 
mass energy of approximately 240 GeV for the Higgs boson 
study through the ete~ — ZH process, as well as collisions 
at the Z-boson peak for precise electroweak physics measure- 
ments. 

The abundant physics research programs at CEPC impose 
stringent requirements on detector performance. The detailed 
performance requirements can be found in the CEPC Con- 
ceptual Design Report (CEPC CDR) [8] published in October 
2018. For the tracker, a tracking efficiency of better than 99% 
is required for charged particles with a transverse momentum 
greater than 1 GeV, and momentum resolutions of the recon- 
structed tracks should be achieved per mille level. 

The 4” conceptual detector is another option for detec- 
tor design, which was proposed based on the CEPC CDR. 
The first three detector designs do not include a drift cham- 
ber, which is an innovative detector design. Other important 
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sistent with the CEPC CDR and will be updated based on per- 
formance testing and future research. As shown in Figure 1, 
the detector design comprises seven subdetectors. From the 
innermost to outermost radii, they comprise the VerteX De- 
tector (VXD), Silicon Internal Tracker (SIT), Drift Chamber 
(DC), Silicon External Tracker (SET), Transverse Crystal- 
bar Electromagnetic Calorimeter (ECAL), Scintillator Glass 
Hadronic Calorimeter (HCAL), and Muon Tracker. The coil 
of a 3 Tesla superconducting magnet is located outside the 
ECAL, and a flux return yoke is embedded in the Muon 
Tracker. One of the characteristics of this detector is the com- 
bination of silicon trackers with a multilayer drift chamber 
to achieve a higher performance for both tracking and parti- 
cle identification. For this detector design, the DC functions 
as a tracking detector that provides an accurate measurement 
of the position of a charged particle [15-17] and is applied 
to identify the particle according to the number of ionization 
clusters lying on the particle track [18]. 

Both the detector design and potential physics studies re- 
quire strong support from detector simulation and event re- 
construction. Therefore, software development for DC simu- 
lation and track reconstruction is critical for CEPC Research 
and Development (CEPC R&D). This study elucidates the 
details of the implementation of the DC software and is or- 
ganized as follows. Section II provides an overview of the 
CEPC software and the workflow of offline data processing. 
In Section III, the DC simulation is introduced, including a 
geometry description[19], detector construction, and digitiza- 
tion [20]. Then, Section IV describes the implementation of 
track reconstruction, including track finding based on a Com- 
binatorial Kalman Filter(CKF) [21, 22] and track fitting using 
GenFit [23]. Finally, the physics performance is presented in 
Section V, followed by a conclusion in Section VI. 


Il. SOFTWARE OVERVIEW 


CEPC Software (CEPCSW) [24] is an offline software sys- 
tem developed to support data processing and analysis in the 
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Fig. 1. Schematic of the CEPC 4‘” conceptual detector. 


72 CEPC experiment. The components in the data processing s3 


73 chain include physics generators, Geant4-based detector sim- a 
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7s lation, machine learning interfaces, and event reconstruction. se 
7s The current software architecture of CEPCSW is illustrated in 87 
77 Figure 2. All software tools and interfaces in the figure were ss 
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The CEPCSW is fully integrated with Key4hep [27], a 116 


common software stack developed for future high-energy 
physics (HEP) experiments. The primary parts of the 
CEPCSW core software include GeomSvc, k4FWCore, 
EDM4hep [28], and Gaudi [29] framework. GeomSve is a 
service through which tracking algorithms can access the de- 
tector geometry from DD4hep[30]._ Edm4hep functions as 
an event data model; however, it also provides bidirectional 
conversion between persistent data and data objects in mem- 
ory, which are handled by k4FWcore. Gaudi is the underlying 
software framework responsible for defining the interfaces for 
all software components, managing data objects in memory, 
and controlling the execution of applications. 


The data processing flow of single-particle events is shown 
in Figure 3 and comprises four steps. Event generation pro- 
duces a list of particles, each of which is generated by a sin- 
gle interaction with a vertex located at the geometric origin. 
In the next step, these generated events are passed into the 
simulation, where each particle is propagated through the de- 
tector using Geant4. During this process, the simulated inter- 
actions between the particles and the detector are recorded. In 
the digitization step, the responses of the elementary detector 
modules are modelled. In addition to Monte Carlo (MC) hits 
from signal events, digitization also accepts hits from back- 
ground events as input. In the final step, the reconstruction 
reads the charge and/or time information and generates tracks 
and showers for the tracking detector and calorimeter, respec- 
tively. 

As mentioned above, EDM4hep is adopted as the event 
data model, which describes the event data generated at each 
step as well as the relationships between two successive steps. 
Additional helper classes are required to facilitate the rapid 
navigation between a generated event and its corresponding 
reconstructed event. 


The detector is described using DD4hep to ensure that 


11 


Ex} 


11 


© 


11 


o 


12 


ò 


12 


122 


123 
124 


125 


126 
12 


N 


12 


© 


12: 


© 


131 


© 


13 


13: 


i} 


133 
134 


13: 


a 


13 


a 


13 


s 


13 


© 


13: 


© 


14 


D 


14 


142 


143 


Event generator 


Detector Simulation 
edm4hep:: 
SimTrackerHit 
edm4hep:: 
TrackerHit 
edm4hep:: 

Track 


Fig. 3. Data processing flow. The blue rectangles represent data pro- 
cessing algorithms, while the green ellipses define the data objects 
produced by the last step and used by the followed step. 


an identical detector geometry is used for different applica- 
tions in the workflow. The core element is based on ROOT 
TGeo[31] with various types of extensions to provide a con- 
sistent detector description for simulation, reconstruction, 
and analysis. DDG4 is used for detector simulation to convert 
the TGeo-based geometry to the Geant4 geometry, whereas 
DDRec is employed to provide higher-level detector geome- 
try, for example, information for subdetector components. 


I. DRIFT CHAMBER SIMULATION 


The detector simulation relies on the Geant4 simulation 
toolkit, which provides physics models and realizes particle 
transportation through geometry. In the CEPCSW, the Geant4 
run manager is wrapped with a Gaudi service, facilitating the 
Gaudi framework in controlling the event loop of the simu- 
lation. This service is responsible for initializing the geome- 
try, physics lists, and user actions of Geant4. It also provides 
standard user interfaces for interacting with Geant4. Because 
of the simulation service, only the detector geometry and re- 
sponse must be implemented for a specific detector. A pre- 
cise description of the detector geometry should contain ex- 
act knowledge of the position, shape, dimension, and mate- 
rial content of every detector component. The digitization 
algorithm is invoked when the energy deposited in the sensi- 
tive regions of the detector increases above a pre-configured 
threshold within a particular time window. The digitization 
output may have provided a detailed signal shape during this 
period. 
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A. Detector geometry 


The DC is surrounded by silicon detectors in both the barrel 
and end-cap regions, covering the radial range of 800—1,800 
mm and the Z range of +2,980 mm. The detailed parameters 
of the DC are listed in Table 1. 


Table 1. Baseline configuration of the drift chamber. 


Geometry Parameters Value 

Half length 2980 mm 

Inner and outer radius 800 mm ~ 1800 mm 
Number of layers 55 

Cell size 18mm x 18mm 
Gas 90%He+10%C4H10 
Single wire resolution 110 pm 

Sense to field wire ratio 1:8 

Total number of sense wire 25,357 


Stereo angle 0.028 rad~0.062 rad 


Sense wire Gold plated Tungsten ¢ = 20 wm 
Field wire Silver plated Aluminum ¢ = 40 um 
Wall Carbon fiber 0.2 mm(inner) and 2.8 mm(outer) 


A small cell design is selected to obtain a sufficient number 
of track hits at the outer radius. The ratio of the half-height 
to the half-width (the distance between a sense wire and its 
neighboring field wire in the r — ¢ plane) is approximately 
1; thus, the shape of the drift cell is almost square. The drift 
cell contains a sense wire surrounded by eight field wires. In 
total, there are 25,357 drift cells of size 18 mm x 18mm. To 
solve the ambiguity [32] caused by multiple hits in the same 
layer, the drift cells between neighboring layers are offset by 
the half-width. 

Because stereo wires have technical advantages in measur- 
ing the positions of charged tracks in the Z-direction, the DC 
is composed purely of stereo wires. All stereo wires are orga- 
nized into 55 coaxial layers. The stereo angle on each layer 
varies as 0.028—0.062 rad, and the wires on any two neigh- 
boring layers are tilted in opposite directions. 

Materials with low densities and atomic numbers are pri- 
marily considered to minimize multiple scattering. Both the 
inner and outer cylinders are made of carbon fibers. The 
working gas for the DC is a mixture of helium and C4H10 
at a ratio of 90:10. The sense wire is made of gold-plated 
tungsten with a diameter of 20 um, and the field wire is made 
of silver-plated aluminum with a diameter of 40 um. 

The DC is described using DD4hep, and the detector 
geometry parameters, such as dimensions, materials, and 
sensitive detectors, are stored in XML(eXtensible Markup 
Language)[33] format. XML serves as a compact file for 
DD4hep and the syntactic structure of the compact XML de- 
scription is obtained from the SiD[34] detector description. 
Therefore, the detector design is associated with a set of cor- 
responding XML files. By versioning the changes in the set 
of XML files, the geometric version can be easily controlled. 

In the simulation, the DC is constructed by parsing the geo- 
metric parameters from the XML files using a specialized de- 
tector constructor, creating individual components, and plac- 
ing them accurately in the correct positions to form a com- 


187 plete virtual detector. The construction process is as fol- 
188 lows. A cylindrical-shaped volume is created according to the 
189 Maximum and minimum radii and length of the DC. Subse- 
190 quently, within the holy volume, wire layers with thicknesses 
191 Of 18mm are created at different radii, each of which is rep- 
192 resented by a hollow cylinder. For a specific layer, the sense 
193 and field wires are directly constructed as tubs and placed at 
194 the right locations. The field wires are shared between neigh- 
195 boring cells with each field wire assigned to one drift cell as 
19 an entire tube. Because of the large number of wires, if ev- 
197 ery drift cell is created independently, it deteriorates the per- 
198 formance in terms of both speed and memory. To solve this 
199 problem, an alternative method is implemented to first con- 
2% Struct the layers and then divide each layer into drift cells by 
21 utilizing DDSegmentation[30] from DD4hep. 


(b) 


Fig. 4. (a) Projection of the first 10 layers of the DC in the r — 
ġ plane. The red dots represent sense wires, while the green dots 
represent field wires. (b) Visualization of sense wires in the first 
layer. The skew is exaggerated. 


After the geometrical model for the DC is built, the Geant4 
203 Visualization tool can be used to produce graphical represen- 
204 tations of the detector and draw views and sections of the de- 
20s tector. Figure 4(a) shows a visualization of the sense and field 
206 Wires created in the simulation, illustrating the r — ¢ projec- 
207 tion of the proportion of the first ten layers of wires. The 
20s Sense wires of each layer forms a rotating hyperboloid sur- 
20% face and Figure 4(b) shows the sense wires in the first layer. 
210 With the help of the visualization tool, the parameters of all 
21 individual geometrical entities can be displayed and checked. 
212 Careful validation of the geometry lays a solid foundation for 
213 the next steps in digitization and tracking. 


202 


214 B. Digitization 

The DC is used to measure the spatial coordinates of the 
216 trajectory of a charged particle. This is achieved by detecting 
217 the ionization electrons produced by the charged particles in 
218 the gas of the DC and measuring their drift time and arrival 
219 positions on the sense wires. 

In the context of detector simulation, digitization refers to 
221 the simulation of the detector response. For the DC, an ac- 
222 curate simulation of the drift time is particularly important. 
223 Because the CEPC detector R&D is in its early stages, a de- 
22⁄4 tailed detector design is still not available. A simplified dig- 
226 itization method is implemented to support the development 
26 Of the tracking algorithm. Its workflow is illustrated in Fig- 
227 ure 5(a). 

In the simulation using Geant4, a small step size i.e 0.5 
229 mm) is selected to simulate the passage of charged particles 
230 through the DC. When the particle enters the drift cell, the 
231 distance between each Geant4 step and the sense wire of the 
232 Cell is recorded. The smallest distance, referred to as doca 
233 (the distance of closest approach), represents the closest ap- 
234 proach of the particle trajectory to the sense wire as shown in 
235 Figure 5(b), and is considered as the drift distance of this hit. 
236 The doca is smeared using a Gaussian function with a width 
237 equivalent to the wire resolution set to 110 um. The drift time 
233 is calculated using the space-time relation (X-T relation) [35]. 
239 In this study, a relatively simple linear X-T relationship is 
240 used. 

During digitization, the effects of various types of back- 
242 ground events are considered . The hits from a signal event 
243 are overlaid with those from additional background events be- 
244 fore the detector response was calculated. The background 
24s primarily originates from the minimum bias and beam—gas in- 
24 teractions. In this study, background events are assumed to be 
247 randomly distributed within a time window of 2,000 ns. For 
24s Simplicity, track recognition is only performed on the group 
249 Of the fastest hits, each of which has the minimum drift time 
250 among all hits belonging to a drift cell. Therefore, during the 
251 digitization stage, when a background hit precedes that of a 
252 Signal within the same drift cell, the signal hit is overridden 
253 by the background hit. Furthermore , the default wire effi- 
254 ciency is assumed to be 100%, and all subsequent results are 
255 based on this assumption. 
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IV. TRACK RECONSTRUCTION 


256 


Track reconstruction is among the most crucial data pro- 
258 cessing tasks in HEP experiments because it facilitates high- 
259 precision position and momentum measurements and identi- 
260 fication of charged particles [36]. A tracking algorithm is de- 
21 veloped to implement track reconstruction by combining the 
22 Spatial measurements (hits) of a traversing particle into sets 
263 (tracks) to extract the kinematic properties of the particle. The 
264 process of track reconstruction comprises two primary stages: 
25 track finding and track fitting, which complement each other 
266 to provide the most accurate track parameters. 

At present, the track reconstruction for the DC is based on 
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Fig. 5. (a) The specific workflow of digitization. (b) Schematic 
diagram of the doca. 


CKF and it requires seed tracks provided by the silicon track- 
ers. Because silicon trackers can offer high-precision mea- 
surements with high efficiency, using silicon tracks as seeds 
fully meets the requirements raised by the development of the 
DC track reconstruction algorithm. We plan to develop a ded- 
icated seed-finding algorithm for the DC track reconstruction. 
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A. Track parameterization 


In a homogeneous magnetic field, the motion of charged 
particles in the DC forms a trajectory that is described by the 
five parameters of a helix, as shown in Equation 1. The first 
three parameters of the equation describe the circle on the X- 
Y plane, as shown in Figure 6, and the last two parameters 
describe the information along the Z direction of the trajec- 
tory, following the conventions used by the BaBar experiment 
[37]. 


P = (do, bo, w, zo, tan A)T. (1) 


The definition of each parameter is as follows: 


e dg is the signed distance from the center of the pro- 
jected circle of the helix to the reference point on the 
X-Y plane. Here, the interaction point (IP) is consid- 
ered as the reference point. 


e ġo is the azimuth angle of the center of the projected 
circle of the helix relative to the IP on the X-Y plane. 


e w is the curvature of the trajectory in the X-Y plane. 
Further, the sign represents the charge of the trajectory 
assigned by track fitting. 


e z is the signed distance between the helix and the IP 
along the Z direction. 


e tanA is the slope of the helix, that is, the inclination 
angle of the projection of the helix on the R-Z plane. 


Because of material effects and magnetic field inhomo- 
geneity, the real trajectory is not a perfect helix, causing these 
five parameters to change along the flight path of the trajec- 
tory. For the tracking algorithms, all DC hits are considered 
as measurement data. In this study, charged particles with 
transverse momentum greater than 0.8 GeV are used. This 
eliminates the possibility of multiple circular tracks, thus al- 
lowing the track to be accurately described by track param- 
eters at the first DC hit. The track parameters are intercon- 
verted with the position and momentum of the hits by calling 
specific functions using the calculation formula referenced in 
[38]. 


B. Extension of geometry interfaces 


The track reconstruction process considers the interaction 
with materials, such as energy loss and multiple scattering, 
which are highly dependent on the geometry, materials, and 
nonuniform magnetic field of the DC. To obtain these infor- 
mation, the following extended interfaces are developed by 
inheriting from GenFit. This because the tracking algorithm 
is based on the GenFit software package and a detailed expla- 
nation of GenFit is presented in Section IV D: 


e CEPCMagneticFieldProvider: This class inherits from 
AbsBField of GenFit, and provides the magnetic 
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Fig. 6. Description of the helix parameters (do, ġo) and the position 
of the POCA (the point of closest approach). 


field strength at specific positions by integrating with 
DD4hep. It accepts the position as input and calculates 
the magnetic field strength from the magnetic field map 
of the detector. 


e CEPCMaterialProvider: This class inherits from Abs- 
MaterialInterface of GenFit, and obtains the geometry 
and material information of the DC based on GeomSvc. 


C. Track finding 


The mission of track-finding is to rapidly and accurately 
determine the potential DC hits produced by the same charged 
track based on candidate seed tracks [39]. DC track finding is 
implemented using three components, as shown in Figure 7: 
finding DC hits with CKF, salvaging DC hits, and appending 
SET hits. 


1. Hits finding 


DC hit finding is among the main procedures in DC track 
finding, and it is implemented by reusing the CKF algorithm 
adopted from Belle II [40]. The CKF is an iterative local algo- 
rithm, first presented in [41], and widely used in HEP experi- 
ments. The CKF starts with a seed estimation of track param- 
eters with uncertainties and then extrapolates the track into 
the detector volume using the Runge-Kutta-Nystrom method 
[42]. 

In CEPCSW, the implementation of the CKF algorithm in- 
volves three main tasks: the acquisition of geometry infor- 
mation for the DC, input/output (I/O) management, and op- 
timization of various parameters. The DC geometry can be 
obtained by implementing classes as introduced in Section 
IV B. In addition, the I/O of the CKF algorithm exists in the 
Belle II data format, thus completing the conversion between 
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Fig. 7. Workflow of track reconstruction. 


EDM4hep and Belle II data model. Because of the differ- 
ent geometrical designs, working gas, and magnetic fields, 
the judgement criteria and standards of the CKF are opti- 
mized, and new thresholds are configured to adapt them to 
the CEPC DC, for example, the extrapolation track length, 
chi-squared(y) value, and doca. 

In this study, five hypotheses are considered: electron, 
muon, pion, kaon, and proton. A schematic and brief 
overview of the track-finding process are illustrated in Fig- 
ure 8. The primary tracking process begins with seed tracks, 
based on which the search roads are built in the DC. These 
seeds are extrapolated along the search roads using the CKF, 
which attempts to smoothen the track while iteratively search- 
ing for neighboring hits both outward and inward. After ex- 
trapolation, potential candidate hits are identified based on the 
drift distance, current position, and uncertainties of the candi- 
date track. When multiple candidate hits are found during the 
propagation step, a prediction is made for each candidate hit 
using the predicted track parameters and measured values to 
assess the next candidate hit. Subsequently, a new candidate 
hit is added to the track and the process is repeated. In cases 
involving multiple mutually exclusive next-candidate hits, the 
entire candidate track is duplicated and subsequently treated 
as two separate tracks. Finally, the final candidate track is se- 
lected based on various quality criteria, such as the residual 
distance between the extrapolated and measured hit positions, 
extrapolated track length, and the doca, etc. 

This process produces a set of potential candidate hits, 
which are then further refined. The advantage of the CKF 
algorithm is its fast running speed. However, it is difficult 
to reliably determine all DC hits because the CKF algorithm 
uses restrictive conditions, leading to only one hit being found 
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hit-salvaging algorithm is developed to solve this problem. 


o Lost DC hit using CKF 

e Sense wire 

= Fitted track 

— Seed track from VXD and SIT 

o Found signal DC hits using CKF 
e SIT hits 


Fig. 8. Schematic of track finding process. For better visibility, the 
schematic shows only a few hits in the DC. First, determining the 
DC hits (the red dots) using CKF is based on the seed track (the 
blue curve), and a rough track (the red curve) is fitted using GenFit. 
Subsequently, the remaining hits are determined using the hits sal- 
vaging algorithm. By extrapolating the track to the remaining hits 
(the green dots), it is determined whether they belong to the track 
based on criteria, such as, the doca, residual of the extrapolated dis- 
tance, etc. 


2. Hits salvaging and appending 


The hit-salvaging algorithm serves as a supplement to the 
hit-finding algorithm and is developed independently to sal- 
vage more signal hits within the DC. This algorithm utilizes 
the preliminary results from the CKF, uses GenFit to perform 
rough track fitting, and extrapolates the track to the DC hits 
lost by the CKF. 

During the examination, the hits are evaluated based on se- 
lection criteria, including the extrapolated track length and 
doca, to determine their association with the current track and 
to add them to the track. Subsequently, by refitting the pre- 
vious track with the retrieved hits using GenFit, more precise 
track parameters can be obtained. These two steps can re- 
move the majority of noise hits to achieve high hit purity. 

Furthermore, the hit salvaging algorithm facilitates the en- 
hancement of the overall accuracy and completeness of the 
track reconstruction process, improving track quality and hit 
efficiency(€;,;,). The hit efficiency is defined as equation 2: 


; N fina 
hit = , 
Nuc 


(2) 


where N fina represents the number of found DC signal hits 
and Nmc represents the number of truth DC hits. Figure 9 
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Fig. 9. Hit efficiency as a function of pr measured using single e7, 
and the pink dots represent the hit efficiency with the hit salvaging 
algorithm, while the teal dots represent the hit efficiency without the 
hit salvaging algorithm. 


Next, the refined track is extrapolated to the SET plane us- 
ing GenFit to determine the expected position of the SET hit. 
If the extrapolated hit position falls within 30 of the actual hit 
position on the SET plane, the SET hit obtained by extrapo- 
lation is added to the track. This step enhances the accuracy 
and reliability of the candidate hits. 


D. Track fitting 


The track-fitting algorithm is performed to accurately de- 
termine the track parameters by fitting all the hits obtained in 
the aforementioned processes and to provide physics quanti- 
ties such as momentum, vertex position, and error matrices of 
the track. Further, this enables the DC to exhibit good track 
resolution and kinematic discrimination capability [43, 44]. 
In the track fitting process, the momentum, vertex position, 
and error matrix of the track are obtained using the method 
of least squares[45]. The least squares method minimizes the 
x? value to determine the parameter estimates for y7,,,,- X? 
is defined as follows. 


?” drift; — doca; 
a) es 
i=l 


2 
a J (3) 


where drift; represents the fitted distance of the it” hit on 
the track, g; represents the measured error of the drift distance 
for the i*” hit, and doca; represents the distance between the 
it” hit sense wire and the fitted track in space, that is, the 
doca. The fitting process involves multiple iterations to adjust 
the track parameters until the fit converges. 

During the track-fitting procedure, the correct handling of 
measurement uncertainties and all physical effects disturbing 
the helical path of the track must be considered to arrive at 


439 a correct description of the track; these effects are highly de- 
440 pendent on the geometry of the detector and the non-uniform 
441 magnetic field. Considering all the physical effects renders 
42 the mathematical description of the track model very com- 
443 plicated and allows only a numerical calculation. Dedicated 
444 fitting algorithms based on GenFit are developed for this pur- 
44s pose because GenFit can accurately determine the track pa- 
44 rameters of charged particles using spatial coordinate mea- 
surements provided by detectors. The GenFit package is inte- 
grated into CEPCSW by extending the necessary interfaces. 


447 


448 


449 1. 


Extension of GenFit interfaces for CEPC 


The configuration of GenFit necessitates a large amount of 
DC information, including the geometry parameters, detector 
materials, strength distribution of the magnetic field, position 
453 Of the sense wires, and drift distance of hits, etc. Further, to 
454 facilitate the integration of GenFit into CEPCSW, seamless 
455 data conversion is necessary. Thus, the following GenFit in- 
ass terface is developed: 


450 
451 


452 


457 e CEPCToGenFitHitConverter: Extracting the necessary 
information based on the TrackerHit object [28] of 
EDM4hep, and creating a corresponding GenfitHit ob- 
ject which is used as the input of track fitting algo- 
rithm. It implements the data model conversion from 


EDMs4hep to GenFit at the level of the hit. 
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459 
460 
461 


462 


463 CEPCToGenFitTrackConverter: Storing the results of 
the track fitting algorithm, including general informa- 
tion (number of iterations, convergence, etc.), fit status 
(x7, NDF, p-value, track length, etc.), and constructing 
a Track object [28] as the output, to achieve data model 


conversion at the level of the track. 
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468 


469 WireMeasurementDC: Inheriting from WireMeasure- 
ment [23], which is used for measurements in wire de- 
tectors (such as straw tubes and drift chambers). To use 
this class, a plane must be described by the u and v axis 
with v coincident with the wire (and u orthogonal to it, 
obviously). This is because it is not valid for arbitrary 


choices of plane orientation. 
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475 


GenfitFitter: Inheriting from AbsKalmanFitter, which 
is used to call the fitter in the GenFit package, and en- 
abling the configuration of the magnetic field, material 
effects, and fitter type. 
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The interfaces for obtaining material and magnetic field 
information are introduced in Section IV B, and Figure 10 
shows the data flow during the track-fitting process. 

After fitting, GenfitTrack objects contain a large amount 
of data: track representations, TrackPoint objects [23] with 
measurements, FitterInfo objects [23] that contain all fitter- 
specific information, and the fitting status used to assess the 
quality of the fitting. In addition, the fitted drift distance for 
each hit is obtained, which is useful for subsequent hit selec- 
tion process. By analyzing this information, the tracks are 
reconstructed with high track efficiency. 
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Fig. 10. Data flow of track fitting process. 


V. TRACKING PERFORMANCE 


To evaluate the performance of the tracking algorithm, 
samples including single-particle events and physics process 
events are generated. Tracks reconstructed using VXD and 
SIT are used as candidate seed tracks, and the performance of 
the tracking algorithm is assessed under various conditions. 


A. Data samples 


Two types of samples are produced to quantify tracking 
performance. The first sample encompasses five types of sin- 
gle particles (e7, uw, n~, K7, and p) generated using the 
particle gun of Geant4. The second sample comprises physics 
events, specifically the ete” — ZH, H — jt 7 process, 
which is generated using the Whizard [46] generator. 

Each single-particle event is generated based on the speci- 
fied polar angle 0 and transverse momentum pr, whereas the 
azimuthal angle ¢ is uniformly distributed from [—7,, 7). The 
barrel components of SIT and SET provide precise hit posi- 
tions inside and outside the DC, improving the reconstruction 
efficiency, particularly for low-momentum charged particles. 
Therefore, single-particle events with cos@ < 0.776 corre- 
spond to the barrel region of the DC with a transverse mo- 
mentum pr in the range of [2, 50] GeV. 

The H — yt p~ process is a clean physics channel with a 
simple final state, which renders it suitable for testing the al- 
gorithm performance. The H — u™ u` events are generated 
by Whizard at leading-order precision, considering the initial 
state radiation and final state radiation processes during event 
generation, as well as the 0.16% energy spread of the electron 


519 beam caused by beam-synchrotron radiation in CEPC. 


520 


B. Quality of track fitting 


To ensure the reliability of track fitting, it is essential to 
522 perform a goodness-of-fit test, which typically includes two 
components: the x? test and reduced residuals (pull) test. The 
x? value is provided by GenfitTrack object after the fitting. 
The reduced residuals (pull) are defined as follows: 
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V fit — Vmeas 


[Fit 


pull = (4) 
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= oe inal 2 

where vrit represents the fitted value obtained from the 
track reconstruction and Vmea is the measured value obtained 
from the simulation in the previous section. In addition, 
Vfit — Vmeas İS the residual between the fitted and measured 
values , and opit and Omeqs denote the standard deviations 
of the fitted and measured values, respectively. The pull dis- 
tribution conforms to a standard normal distribution with a 
good fit, indicating that most of the residuals are approxi- 
mately zero and exhibit a symmetrical shape. The pull test 
is used to examine track momentum, drift distance, and track 
parameters. To calculate the pull, the true value is considered 
with a standard deviation of 0. Consequently, equation (4) is 
simplified to 
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pull _ Prt = P meas 
O fit , 
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54 


where Pri expresses the fitted value and Pmeas denotes 
the measured value. 

The momentum resolution is an important metric in track 
reconstruction. Figure 11 shows the residual distribution be- 
tween the transverse momentum of the fitted and truth tracks 
for single u~ with 10 GeV. The distribution closely follows 
a Gaussian distribution, yielding a momentum resolution of 
14.0 MeV, which satisfies the performance requirements of 
the CEPC tracking system. 

Spatial resolution is a critical standard that represents the 
accuracy of spatial position reconstruction. Figure 12 shows 
the residual distribution between the fitted and drift distances, 
which also exhibits a Gaussian distribution. The spatial res- 
olution achieved is 106 um, which is slightly below the set 
value of 110 um. The method of fitting tracks containing 
measurement points [47], where the construction of x? in the 
least-squares fit includes the contribution of the measurement 
points, can result in an underestimation of the spatial resolu- 
tion compared to the true value in the reconstruction process. 
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560 C. Track parameters resolution 


The main results of the track reconstruction are the five 
track parameters as well as their distribution and resolution, 
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Fig. 11. Blue dots represent the residual distribution between prrec 
and prmc for single y` with 10 GeV. prrec and prmc represent 
the transverse momentum of the fitted track and the true track, re- 
spectively. The orange line represents the fitted Gaussian function. 
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Fig. 12. Blue dots represent the residual distribution between the 
fitted and drift distances for single u` with 10 GeV. The orange 
line represents the fitted Gaussian function. 


which are related to the performance of the physics analy- 
sis. Owing to the influence of multiple scattering, correla- 
tions are observed between the track parameters, with signifi- 
cant correlations among the parameters within the same plane 
and smaller correlations between the parameters of two differ- 
ent planes. Furthermore, the geometric curvature k(= 1/w) 
of the track is related to transverse momentum pr, where 
K = q/pr. Because the reconstructed instances use the same 
charge, the resolution of the transverse momentum pr of the 
reconstructed track indirectly reflects the resolution of «. The 
following resolutions for do, zo, and transverse momentum 
pr are provided to verify the results of the track reconstruc- 
tion. 
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The resolution of the track parameters is obtained by fit- 
ting the distribution of the track parameters residuals using a 
sve Gaussian function. In the top panels of Figure 13, the reso- 
579 lutions of the impact parameters do, zo and the relative reso- 
ss lution of pr are shown as functions of the particle pr. The 
ss1 bottom panels of Figure 13 display the resolutions of the im- 
see pact parameters do, Zo at a fixed transverse momentum pr = 


576 


577 


583 


593 


594 
59! 


a 


591 


o 


597 


59% 


© 


599 


601 


S 


60 


602 


603 
604 


60: 


a 


606 


60 


gS 


60! 


© 


609 
61 


o 


61 


61 


N 


61 


W 


614 


61 


a 


61 


a 


61 


Ex} 


61 


© 


61 


o 


62 


ò 


62 


622 
623 
624 


62! 


a 


62 


a 


62 


` 


628 
629 


631 


S 


631 
632 
633 


63 


A 


8 GeV and the relative resolution of pr as a function of the 
polar angle |cos0|. For each pr and |cos0|, a sample of 20k 
events of single u~ is generated. Owing to the increased ma- 
terial effect, the resolution of do, zo deteriorates at lower pr 
and larger |cos6|. The relative resolution of pr is dependent 
on pr because the curvature of the track on the X-Y plane 
is dominated by the multiple-scattering effect. In the high- 
momentum region, the momentum resolution is dominated 
by the single-point resolution of the tracks, resulting in worse 
resolution with an increase in pr. 


D. Tracking efficiency 


Another important criterion for evaluating the track recon- 
struction performance is the tracking efficiency(¢), which is 
related to the tracking algorithm, particle type, and particle 
momentum. Before analyzing track efficiency, it is essential 
to establish a definition of a good track. A good track is de- 
fined as follows: the x? value is below 400 and the number of 
DC signal hits on the track is more than six. 

The definition of tracking efficiency e is 


Nrec 
7 
Nseed 


e(%) (6) 


where Nseeq is the number of satisfactory seed tracks suc- 
cessfully reconstructed using VXD and SIT, and N,pec is the 
number of reconstructed good-tracks. 

As material effects easily lead to the generation of sec- 
ondary particles, resulting in inconsistencies between the re- 
constructed track using MC particles as the seed and true 
tracks, high-quality track events are selected. When a charged 
particle enters the DC with minimal energy loss, it is con- 
sidered to be a high-quality track, which is evaluated by 
(Pme — Pfirst)/Pme, Where Pme represents the true momen- 
tum and pfirst represents the momentum of the first DC hit. 

Figure 14 shows the track efficiency as a function of pr 
for the five types of single particles, and the track efficiency 
for all types particles remains consistently above 99.5% with- 
out background, particularly for single ~~, and the track effi- 
ciency is approximately 100%. Further, when a particle goes 
through the DC, only the energy loss of the particle is con- 
sidered. If other interactions occur in the DC material and 
new types of particles are generated, these events are not 
considered in the tracking efficiency calculations. Thus, no 
distinction is made between particles and antiparticles in this 
study. The evaluation of track reconstruction with the back- 
ground is important, as the track reconstruction of a real de- 
tector is susceptible to the background from different sources, 
significantly impacting the accuracy of the reconstruction re- 
sult. To assess the performance of the tracking algorithm 
with the background and measure its ability to remove noise, 
the tracking efficiency is tested at different background lev- 
els. Figure 15 presents the track efficiency as a function 
of pr for single u`, both without background and with a 
noise level of 20%. As evident, track efficiency is not sig- 
nificantly affected by the background, remaining consistently 
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above 99.8%. This indicated the good stability and robustness 
of the tracking algorithm. The track multiplicity is a critical 
and significant issue, and preliminary research has been con- 
ducted. The generated MC events exhibit an average of 10 
tracks with an average momentum of 2 GeV with a tracking 
efficiency of 99.6%. This suggests that the current track re- 
construction algorithm can handle multiple tracks effectively 
and achieve good performance. 


E. Physics event reconstruction 


To verify the accuracy of the tracking algorithm in physics 
events, ete7 — ZH, H — u* u` process is analyzed. Fig- 
ure 16 shows the reconstructed mass distribution of the Higgs 
boson fitted to the CrystalBall function. The mean value is 
measured at 125 GeV and aligned with the expected invariant 
mass of the Higgs boson. Further, the standard deviation is 
measured at 212.5 MeV. 

Based on the aforementioned results and performance eval- 
uations, it can be concluded that the tracking algorithm oper- 
ates effectively and satisfies the performance requirements of 
the CEPC project. 


VI. CONCLUSION 


This study proposes the design and implementation of the 
DC software for CEPC, comprising four building blocks: 
event generation, DC simulation, digitization, and track re- 
construction. The software is developed using C++ and 
Python based on the Gaudi framework and relies on several 
external libraries, including Geant4, CLHEP, ROOT, BOOST, 
GenFit, EDM4hep, DD4hep, and Key4hep. A modular de- 
sign style is adopted based on Gaudi dynamically loadable 
elements, which enhances the maintainability, extensibility, 
and reliability. 

In the DC simulation, the DC geometry is constructed, 
including layers with stereo wires, materials, and magnetic 
field, and DDSegmentation is used to divide the virtual drift 
cells, thereby reducing runtime and saving computational re- 
sources. Digitization is implemented to simulate the elec- 
tronic readout, considering a simple uniformly random back- 
ground. This renders the simulation closer to real data, and 
provides a large amount of simulated data for reconstruction 
and physics research. 

The tracking algorithm is implemented using four main 
processes: finding DC hits based on the CKF, salvaging DC 
hits lost by the CKF, appending the SET hit, and fitting the 
track using GenFit. First, DC hits are determined using CKF 
based on the seed tracks provided by the silicon detector. The 
CKF from Belle II is reused with the conversion of the data 
model, expansion of the geometry interface, and optimization 
of the configuration parameters, resulting in the overall best 
tracking performance. Second, an algorithm for salvaging 
DC hits is independently developed using GenFit for extrap- 
olation, serving as a supplement to the hit-finding algorithm, 
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Fig. 13. Resolution of do (left panels), zo (middle panels) and relative resolution of pr (right panels) for single p~ as function of particle pr 


(top panels) and polar angle |cos6| (bottom panels) respectively. 
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Fig. 14. Tracking efficiency as a function of pr measured using 
single particles, including e`, 4, m~, K`, p, without background. 


thereby significantly improving hit efficiency and track qual- 


67 ity. A SET hit is then added to the track to improve the ac- 
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curacy of the tracking algorithm. Finally, the tracking toolkit 
GenFit for track fitting is adopted at CEPC based on a 55- 
layers drift chamber, and for the first time, a detailed intro- 
duction and application in high-energy physics experiments 
are demonstrated. 

The performance of the tracking algorithm has been stud- 
ied in addition to Geant4-based full simulation of particle in- 
teractions with the detectors and H —> u*u events. The 
fitted tracks are successfully fitted and converged, with a fit- 
ting success rate of 99.8%. The time required to reconstruct 
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Fig. 15. Track efficiency for single u~ as a function of particle pr. 
The light blue dots represent the results without background, and the 
light green dots represent the results with a noise level of 20%. For 
each pr, a sample of 5k events is used for the study. 


a single u` event with 2 GeV, including track finding and 
track fitting, has an average of 0.294 s. This number is reason- 
able. The tracking efficiency is above 99.55% for five types 
of single particles (e~, u~, m~, K7, p) and for single u~ 
remains above 99.80% with a noise level of 20%, which is 
consistent with the case without background. The track ef- 
ficiency of 99.6% for multiple tracks events generated by 2 
GeV w`, illustrates the track reconstruction algorithm can 
handle multi-tracks and exhibits good performance. When 
the momentum is 10 GeV and |cos@| = 0.09, op, /pr is less 
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Fig. 16. The reconstructed dimuon mass distributions from the H —> 
pt u events. 


[1] G. Aad, T. Abajyan, B. Abbott, and et al (ATLAS), Phys. Lett. 
B 716, 1 (2012), comments: 24 pages plus author list (38 pages 
total), 12 figures, 7 tables, revised author list, 1207.7214, URL 
https: //cds.cern.ch/record/1471031. 

[2] S. Chatrchyan, V. Khachatryan, A. Sirunyan, and et al, 
Physics Letters B 716, 30 (2012), ISSN 0370-2693, 
URL https://www.sciencedirect.com/science/ 
article/pii/S0370269312008581. 

[3] L. Evans and P. Bryant, Journal of Instrumentation 3, 
S08001 (2008), URL https: //dx.doi.org/10.1088/ 
1748-0221/3/08/S08001. 

[4] S. F. Novaes, Standard model: An introduction (2000), hep- 
ph/0001283. 

[5] T. A. collaboration, Tech. Rep., CERN, Geneva (2013), URL 
https://cds.cern.ch/record/1611190. 

[6] H. Cheng, W. H. Chiu, Y. Fang, Y. Gao, J. Gu, G. Li, T. Li, 

Z. Liang, B. Liu, J. Liu, et al., The physics potential of the 

cepc. prepared for the us snowmass community planning exer- 

cise (snowmass 2021) (2022), 2205.08553. 

M. Koratzinos et al., in Proc. of International Particle Ac- 

celerator Conference (IPAC’17), Copenhagen, Denmark, 19 

May, 2017 (JACoW, Geneva, Switzerland, 2017), no. 8 in 

International Particle Accelerator Conference, pp. 3003-3006, 

ISBN 978-3-95450-182-3, https://doi.org/10.18429/JACoW- 

IPAC2017-WEPIK034, URL http://jacow.org/ 

ipac2017/papers/wepik034.pdf. 

[8] T. C. S. Group, Cepc conceptual design report: Volume 2 - 
physics & detector (2018), 1811.10545. 

[9] A. Djouadi, J. Lykken, K. Mönig, Y. Okada, M. Oreglia, and 
S. Yamashita, International linear collider reference design re- 
port volume 2: Physics at the ilc (2007), 0709.1893. 

[10] CERN, Cern yellow reports: Monographs, vol2 (2018): The 
compact linear e+e- collider (clic) : 2018 summary re- 
port (1970), URL https: //e-publishing.cern.ch/ 
index.php/CYRM/issue/view/ 66. 

[11] Z-X. Chen, Y. Yang, M.-Q. Ruan, D.-Y. Wang, G. Li, 
S. Jin, and Y. Ban, Chinese Physics C 41, 023003 (2017), 
ISSN 1674-1137, URL http: //dx.doi.org/10.1088/ 
1674-1137/41/2/023003. 

[12] T. Han, Z. Liu, and J. Sayre, Phys. Rev. D 89, 
113006 (2014), URL https://link.aps.org/doi/ 
10.1103/PhysRevD.89.113006. 


[7 


(S; 


708 
709 
710 
71 


712 


713 


714 


715 
716 
717 
718 
719 
720 
72 


722 


765 
766 
767 
768 
769 
770 
771 
772 
773 
774 
775 
776 
777 
778 
779 
780 
781 
782 
783 
784 
785 
786 
787 
788 
789 
790 
791 
792 
793 
794 
795 
796 
797 
798 
799 
800 
801 
802 
803 
804 
805 
806 


12 


than 0.14% which satisfies the performance requirements of 
CEPC. The reconstructed dimuon mass distributions from the 
H — u* p” events are reasonable, with a sigma of 0.21 GeV. 
The results demonstrate that the tracking algorithm exhibits 
an excellent performance, satisfies the performance require- 
ments of the CEPC tracking system, and can be utilized for 
further research. 
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lation of detector response in the DC, and focus on the track- 
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beam-induced backgrounds. Moreover, further optimization 
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struction algorithm. Furthermore, a seed-finding algorithm 
will be developed to provide seed tracks for DC track recon- 
struction. 
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